173 research outputs found
FASTCUDA: Open Source FPGA Accelerator & Hardware-Software Codesign Toolset for CUDA Kernels
Using FPGAs as hardware accelerators that communicate with a central CPU is becoming a common practice in the embedded design world but there is no standard methodology and toolset to facilitate this path yet. On the other hand, languages such as CUDA and OpenCL provide standard development environments for Graphical Processing Unit (GPU) programming. FASTCUDA is a platform that provides the necessary software toolset, hardware architecture, and design methodology to efficiently adapt the CUDA approach into a new FPGA design flow. With FASTCUDA, the CUDA kernels of a CUDA-based application are partitioned into two groups with minimal user intervention: those that are compiled and executed in parallel software, and those that are synthesized and implemented in hardware. A modern low power FPGA can provide the processing power (via numerous embedded micro-CPUs) and the logic capacity for both the software and hardware implementations of the CUDA kernels. This paper describes the system requirements and the architectural decisions behind the FASTCUDA approach
Sqrt{shat}_{min} resurrected
We discuss the use of the variable sqrt{shat}_{min}, which has been proposed
in order to measure the hard scale of a multi parton final state event using
inclusive quantities only, on a SUSY data sample for a 14 TeV LHC. In its
original version, where this variable was proposed on calorimeter level, the
direct correlation to the hard scattering scale does not survive when effects
from soft physics are taken into account. We here show that when using
reconstructed objects instead of calorimeter energy and momenta as input, we
manage to actually recover this correlation for the parameter point considered
here. We furthermore discuss the effect of including W + jets and t tbar+jets
background in our analysis and the use of sqrt{shat}_{min} for the suppression
of SM induced background in new physics searches.Comment: 23 pages, 9 figures; v2: 1 figure, several subsections and references
as well as new author affiliation added. Corresponds to published versio
Queue Management in Network Processors
Abstract: -One of the main bottlenecks when designing a network processing system is very often its memory subsystem. This is mainly due to the state-of-the-art network links operating at very high speeds and to the fact that in order to support advanced Quality of Service (QoS), a large number of independent queues is desirable. In this paper we analyze the performance bottlenecks of various data memory managers integrated in typical Network Processing Units (NPUs). We expose the performance limitations of software implementations utilizing the RISC processing cores typically found in most NPU architectures and we identify the requirements for hardware assisted memory management in order to achieve wire-speed operation at gigabit per second rates. Furthermore, we describe the architecture and performance of a hardware memory manager that fulfills those requirements. This memory manager, although it is implemented in a reconfigurable technology, it can provide up to 6.2Gbps of aggregate throughput, while handling 32K independent queues
Forward Jets and Energy Flow in Hadronic Collisions
We observe that at the Large Hadron Collider, using forward + central
detectors, it becomes possible for the first time to carry out calorimetric
measurements of the transverse energy flow due to "minijets" accompanying
production of two jets separated by a large rapidity interval. We present
parton-shower calculations of energy flow observables in a high-energy
factorized Monte Carlo framework, designed to take into account QCD logarithmic
corrections both in the large rapidity interval and in the hard transverse
momentum. Considering events with a forward and a central jet, we examine the
energy flow in the interjet region and in the region away from the jets. We
discuss the role of these observables to analyze multiple parton collision
effects.Comment: 9 pages, 5 figures. Version2: added results on azimuthal
distributions and more discussion of energy flow definition using jet
clusterin
FASTER: Facilitating Analysis and Synthesis Technologies for Effective Reconfiguration
The FASTER (Facilitating Analysis and Synthesis Technologies for Effective Reconfiguration) EU FP7 project, aims to ease the design and implementation of dynamically changing hardware systems. Our motivation stems from the promise reconfigurable systems hold for achieving high performance and extending product functionality and lifetime via the addition of new features that operate at hardware speed. However, designing a changing hardware system is both challenging and time-consuming. FASTER facilitates the use of reconfigurable technology by providing a complete methodology enabling designers to easily specify, analyze, implement and verify applications on platforms with general-purpose processors and acceleration modules implemented in the latest reconfigurable technology. Our tool-chain supports both coarse- and fine-grain FPGA reconfiguration, while during execution a flexible run-time system manages the reconfigurable resources. We target three applications from different domains. We explore the way each application benefits from reconfiguration, and then we asses them and the FASTER tools, in terms of performance, area consumption and accuracy of analysis
Vitamin-V: Virtual Environment and Tool-boxing for Trustworthy Development of RISC-V based Cloud Services
Vitamin-V is a 2023-2025 Horizon Europe project that aims to develop a complete RISC-V open-source software stack for cloud services with comparable performance to the cloud-dominant x86 counterpart and a powerful virtual execution environment for software development, validation, verification, and test that considers the relevant RISC-V ISA extensions for cloud deployment
RECO level \sqrt{s}_{min} and subsystem \sqrt{s}_{min}: improved global inclusive variables for measuring the new physics mass scale in missing energy events at hadron colliders
The variable \sqrt{s}_{min} was originally proposed in arXiv:0812.1042 as a
model-independent, global and fully inclusive measure of the new physics mass
scale in missing energy events at hadron colliders. In the original incarnation
of \sqrt{s}_{min}, however, the connection to the new physics mass scale was
blurred by the effects of the underlying event, most notably initial state
radiation and multiple parton interactions. In this paper we advertize two
improved variants of the \sqrt{s}_{min} variable, which overcome this problem.
First we show that by evaluating the \sqrt{s}_{min} variable at the RECO level,
in terms of the reconstructed objects in the event, the effects from the
underlying event are significantly diminished and the nice correlation between
the peak in the \sqrt{s}_{min}^{(reco)} distribution and the new physics mass
scale is restored. Secondly, the underlying event problem can be avoided
altogether when the \sqrt{s}_{min} concept is applied to a subsystem of the
event which does not involve any QCD jets. We supply an analytic formula for
the resulting subsystem \sqrt{s}_{min}^{(sub)} variable and show that its peak
exhibits the usual correlation with the mass scale of the particles produced in
the subsystem. Finally, we contrast \sqrt{s}_{min} to other popular inclusive
variables such as H_T, M_{Tgen} and M_{TTgen}. We illustrate our discussion
with several examples from supersymmetry, and with dilepton events from top
quark pair production.Comment: 41 pages, 26 figure
- …